Search CORE

7 research outputs found

Predictive Coding For Animation-Based Video Compression

Author: Konuko Goluck
Lathuilière Stéphane
Valenzise Giuseppe
Publication venue
Publication date: 09/07/2023
Field of study

We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face motions with a compact set of sparse keypoints. However, these methods encode video in a frame-by-frame fashion, i.e. each frame is reconstructed from a reference frame, which limits the reconstruction quality when the bandwidth is larger. Instead, we propose a predictive coding scheme which uses image animation as a predictor, and codes the residual with respect to the actual target frame. The residuals can be in turn coded in a predictive manner, thus removing efficiently temporal dependencies. Our experiments indicate a significant bitrate gain, in excess of 70% compared to the HEVC video standard and over 30% compared to VVC, on a datasetof talking-head videosComment: Accepted paper: ICIP 202

arXiv.org e-Print Archive

PREDICTIVE CODING FOR ANIMATION-BASED VIDEO COMPRESSION

Author: Konuko Goluck
Lathuilière Stéphane
Valenzise Giuseppe
Publication venue: HAL CCSD
Publication date: 08/10/2023
Field of study

Accepted paper: ICIP 2023We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face motions with a compact set of sparse keypoints. However, these methods encode video in a frame-by-frame fashion, i.e., each frame is reconstructed from a reference frame, which limits the reconstruction quality when the bandwidth is larger. Instead, we propose a predictive coding scheme which uses image animation as a predictor, and codes the residual with respect to the actual target frame. The residuals can be in turn coded in a predictive manner, thus removing efficiently temporal dependencies. Our experiments indicate a significant bitrate gain, in excess of 70% compared to the HEVC video standard and over 30% compared to VVC, on a dataset of talking-head videos. Our code is available at github.com/animation-based-codecs

HAL-CentraleSupelec

A Hybrid Deep Animation Codec for Low-Bitrate Video Conferencing

Author: Konuko Goluck
Lathuilière Stéphane
Valenzise Giuseppe
Publication venue: HAL CCSD
Publication date: 16/10/2022
Field of study

International audienceDeep generative models, and particularly facial animation schemes, can be used in video conferencing applications to efficiently compress a video through a sparse set of keypoints, without the need to transmit dense motion vectors. While these schemes bring significant coding gains over conventional video codecs at low bitrates, their performance saturates quickly when the available bandwidth increases. In this paper, we propose a layered, hybrid coding scheme to overcome this limitation. Specifically, we extend a codec based on facial animation by adding an auxiliary stream consisting of a very low bitrate version of the video, obtained through a conventional video codec (e.g., HEVC). The animated and auxiliary videos are combined through a novel fusion module. Our results show consistent average BD-Rate gains in excess of-30% on a large dataset of video conferencing sequences, extending the operational range of bitrates of a facial animation codec alone

HAL-CentraleSupelec

A HYBRID DEEP ANIMATION CODEC FOR LOW-BITRATE VIDEO CONFERENCING

Author: Konuko Goluck
Lathuilière Stéphane
Valenzise Giuseppe
Publication venue: HAL CCSD
Publication date: 27/07/2022
Field of study

Ultra-low bitrate video conferencing using deep image animation

Author: Konuko Goluck
Lathuilière Stéphane
Valenzise Giuseppe
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2020
Field of study

International audienceIn this work we propose a novel deep learning approach for ultra-low bitrate video compression for video conferencing applications. To address the shortcomings of current video compression paradigms when the available bandwidth is extremely limited, we adopt a model-based approach that employs deep neural networks to encode motion information as keypoint displacement and reconstruct the video signal at the decoder side. The overall system is trained in an end-to-end fashion minimizing a reconstruction error on the encoder output. Objective and subjective quality evaluation experiments demonstrate that the proposed approach provides an average bitrate reduction for the same visual quality of more than 80% compared to HEVC

HAL-CentraleSupelec

arXiv.org e-Print Archive

HAL Descartes

Hal-Diderot

HAL-Rennes 1